Correlation Analysis of Binary Similarity and Distance Measures on Different Binary Database Types
نویسندگان
چکیده
Binary similarity and dissimilarity measures are of great importance to pattern recognition and other fields. Here, correlations between pairs of 76 binary similarity and distance measures are studied. Some similarity measures are highly correlated while others are not, and the variability of the correlation can depend on the characteristics of the underlying binary data. To better understand the variation of the correlations, we define three basic types of binary databases. The variations of the correlations on these database types are statistically analyzed, and database variant and invariant correlations are identified. In addition to common linear correlation patterns between measures, numerous unusual and interesting correlation patterns are also presented.
منابع مشابه
A Survey of Binary Similarity and Distance Measures
The binary feature vector is one of the most common representations of patterns and measuring similarity and distance measures play a critical role in many problems such as clustering, classification, etc. Ever since Jaccard proposed a similarity measure to classify ecological species in 1901, numerous binary similarity and distance measures have been proposed in various fields. Applying approp...
متن کاملEulerian Lagrangian Simulation of Particle Capture and Dendrite Formation on Binary Fibers
The capture efficiency of the small aerosol particle is strongly influenced by the structure of fibrous layers. This study presents particle deposition and dendrite formation on different arrangements of binary fibers. 2-D numerical simulation is performed using the open source software of OpenFOAM. In the instantaneous filtration of a single fiber, obtained results are in good agreement with th...
متن کاملDyVSoR: dynamic malware detection based on extracting patterns from value sets of registers
To control the exponential growth of malware files, security analysts pursue dynamic approaches that automatically identify and analyze malicious software samples. Obfuscation and polymorphism employed by malwares make it difficult for signature-based systems to detect sophisticated malware files. The dynamic analysis or run-time behavior provides a better technique to identify the threat. In t...
متن کاملMandibular Trabecular Bone Analysis Using Local Binary Pattern for Osteoporosis Diagnosis
Background: Osteoporosis is a systemic skeletal disease characterized by low bone mineral density (BMD) and micro-architectural deterioration of bone tissue, leading to bone fragility and increased fracture risk. Since Panoramic image is a feasible and relatively routine imaging technique in dentistry; it could provide an opportunistic chance for screening osteoporosis. In this regard, numerous...
متن کاملProcess Mining by Measuring Process Block Similarity
Mining, discovering, and integrating process-oriented services has attracted growing attention in the recent year. Workflow precedence graph and workflow block structures are two important factors for comparing and mining processes based on distance similarity measure. Some existing work has done on comparing workflow designs based on their precedence graphs. However, there lacks of standard di...
متن کامل